CDS
Accession Number | TCMCG075C01380 |
gbkey | CDS |
Protein Id | XP_017972537.1 |
Location | join(7152498..7152785,7152935..7153046,7153209..7153383,7153749..7153758,7153871..7154089,7154402..7154677,7154827..7155126) |
Gene | LOC18611834 |
GeneID | 18611834 |
Organism | Theobroma cacao |
Protein
Length | 459aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018117048.1 |
Definition | PREDICTED: squalene monooxygenase [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | I |
Description | squalene |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction |
R02874
[VIEW IN KEGG] |
KEGG_rclass |
RC00201
[VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] |
KEGG_ko |
ko:K00511
[VIEW IN KEGG] |
EC |
1.14.14.17
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko00100
[VIEW IN KEGG] ko00909 [VIEW IN KEGG] ko01100 [VIEW IN KEGG] ko01110 [VIEW IN KEGG] ko01130 [VIEW IN KEGG] map00100 [VIEW IN KEGG] map00909 [VIEW IN KEGG] map01100 [VIEW IN KEGG] map01110 [VIEW IN KEGG] map01130 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGTGGCCTTTGCAAGCCTATAAAAGCCTCAAAGTTCAGTTGCTGGAAACACACACTAAACACAAAAACAATTCAGTTCTCGAGAAGATGGATTGCCAATATCTACTGGGAGCAATTCTGGCTTCTGTTTTGGGTCTTCTCTTGCTGTACAAGAAAATTCAGAAGCAGACATTTTCAATGAGCTCTGATAATGGATTACCCCGAGACTCAGAGATAGGTAGGAGTGCAGATATAATCATCGTTGGGGCAGGGGTTGCTGGTTCTGCTCTTGCTTACACTCTTGGGAAGGATGGACGCCAAGTGAAAGTGATTGAGAGAGACTTGACTGAACCTGATAGAATTGTTGGAGAACTTCTACAACCTGGAGGTTACCTTAAATTGATGGAGTTGGGGCTTGAGGAATGTGTGAATGGGATTGATGCTCAGCGCGTGTTTGGATATGCTCTGTTTAAGGATGGGAAAAGCACTAAACTATCATATCCCTTGCAACATTTCGAATCAGATGTTGCCGGGAGAAGCTTCCATAATGGACGTTTCATTCAAAGAATGAGAGAAAAAGCATCAACCCTTCCCAAGATCATACATGTTGACATCCCTCCTTGTTTTGTTGCTTTGGTCCTGGAGAACTGTCAGCTTCCATATGCAAATCATGGACACGTTATTTTGGCAGATCCTTCTCCAATCCTGTTTTATCCTATTAGTAGCACAGAGGTTCGTTGTCTGGTTGATGTACCTGGACAAAAGGTTCCCTCAGTCTCTAATGGCGAAATGTCCCATTTCTTGAAAACTGAGGTGGCTCCTCAGATCCCCTCTGAGCTATACAATACTTTCATATCTGCAATCAATAAGGGAAACATAAGAACGATGCCTAACAGAAGCATGCCTGCTTCTCCCTATCCAACTCCGGGTGCACTGTTGATGGGAGATGCATTTAACATGAGGCATCCTTTGACCGGTGGAGGAATGACTGTGGCACTCTCTGATATCGTCGTTTTAAGAGACCTCCTTAGACCTCTGCGCGATCTAAATGATGCATCTGCCTTGTGCAAGTATCTGGAATCCTTCTACACTCTACGTAAGCCAACAGCATCTACCATATATACACTAGCCGGTGCCCTCTATAAGGTGTTTAGCGGATCATCTGACCCAGCAAGGAGAGAAATGCGCAAAGCATGCTTCGATTATTTAAGTCTTGGCGGTGTTTTTTCAAATGGACCAATAGCTTTACTCTCTGGAATGAATCCTCGGCCATTGAACCTGATCCTACACTTCTTCGCTGTGGCTGTATATGGCTTTGGGCGCTTACTGCTTCCATTTCCTTCATTTAAACGCCTTCTACTTGGTGCTAGATTAATTTCGGCTGGTATCTTCATTGCTTAG |
Protein: MWPLQAYKSLKVQLLETHTKHKNNSVLEKMDCQYLLGAILASVLGLLLLYKKIQKQTFSMSSDNGLPRDSEIGRSADIIIVGAGVAGSALAYTLGKDGRQVKVIERDLTEPDRIVGELLQPGGYLKLMELGLEECVNGIDAQRVFGYALFKDGKSTKLSYPLQHFESDVAGRSFHNGRFIQRMREKASTLPKIIHVDIPPCFVALVLENCQLPYANHGHVILADPSPILFYPISSTEVRCLVDVPGQKVPSVSNGEMSHFLKTEVAPQIPSELYNTFISAINKGNIRTMPNRSMPASPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLRPLRDLNDASALCKYLESFYTLRKPTASTIYTLAGALYKVFSGSSDPARREMRKACFDYLSLGGVFSNGPIALLSGMNPRPLNLILHFFAVAVYGFGRLLLPFPSFKRLLLGARLISAGIFIA |